Search CORE

85 research outputs found

Analysis of a data matrix and a graph: Metagenomic data and the phylogenetic tree

Author: Purdom Elizabeth
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2011
Field of study

In biological experiments researchers often have information in the form of a graph that supplements observed numerical data. Incorporating the knowledge contained in these graphs into an analysis of the numerical data is an important and nontrivial task. We look at the example of metagenomic data---data from a genomic survey of the abundance of different species of bacteria in a sample. Here, the graph of interest is a phylogenetic tree depicting the interspecies relationships among the bacteria species. We illustrate that analysis of the data in a nonstandard inner-product space effectively uses this additional graphical information and produces more meaningful results.Comment: Published in at http://dx.doi.org/10.1214/10-AOAS402 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics.

Author: Das Diya
Dudoit Sandrine
Fletcher Russell B
Ngai John
Purdom Elizabeth
Risso Davide
Street Kelly
Yosef Nir
Publication venue: eScholarship, University of California
Publication date: 01/01/2018
Field of study

BackgroundSingle-cell transcriptomics allows researchers to investigate complex communities of heterogeneous cells. It can be applied to stem cells and their descendants in order to chart the progression from multipotent progenitors to fully differentiated cells. While a variety of statistical and computational methods have been proposed for inferring cell lineages, the problem of accurately characterizing multiple branching lineages remains difficult to solve.ResultsWe introduce Slingshot, a novel method for inferring cell lineages and pseudotimes from single-cell gene expression data. In previously published datasets, Slingshot correctly identifies the biological signal for one to three branching trajectories. Additionally, our simulation study shows that Slingshot infers more accurate pseudotimes than other leading methods.ConclusionsSlingshot is a uniquely robust and flexible tool which combines the highly stable techniques necessary for noisy single-cell data with the ability to identify multiple trajectories. Accurate lineage inference is a critical step in the identification of dynamic temporal gene expression

Directory of Open Access Journals

eScholarship - University of California

Archivio istituzionale della ricerca - Università di Padova

Transcription restores DNA repair to heterochromatin, determining regional mutation rates in cancer genomes

Author: Cho Raymond J.
Fudem Gary M.
Purdom Elizabeth
Zheng Christina L.
Publication venue: eScholarship@UMassChan
Publication date: 20/11/2014
Field of study

Somatic mutations in cancer are more frequent in heterochromatic and late-replicating regions of the genome. We report that regional disparities in mutation density are virtually abolished within transcriptionally silent genomic regions of cutaneous squamous cell carcinomas (cSCCs) arising in an XPC(-/-) background. XPC(-/-) cells lack global genome nucleotide excision repair (GG-NER), thus establishing differential access of DNA repair machinery within chromatin-rich regions of the genome as the primary cause for the regional disparity. Strikingly, we find that increasing levels of transcription reduce mutation prevalence on both strands of gene bodies embedded within H3K9me3-dense regions, and only to those levels observed in H3K9me3-sparse regions, also in an XPC-dependent manner. Therefore, transcription appears to reduce mutation prevalence specifically by relieving the constraints imposed by chromatin structure on DNA repair. We model this relationship among transcription, chromatin state, and DNA repair, revealing a new, personalized determinant of cancer risk

eScholarship@UMMS

Error Distribution for Gene Expression Data

Author: Elizabeth Purdom
Susan P Holmes
Publication venue: 'Walter de Gruyter GmbH'
Publication date
Field of study

Crossref

Recommended from our members

Strong succession in arbuscular mycorrhizal fungal communities.

Author: Coleman-Derr Devin
Dahlberg Jeffery
Gao Cheng
Hollingsworth Joy
Hutmacher Robert
Lemaux Peggy
Madera Mary
Montoya Liliam
Purdom Elizabeth
Taylor John
Xu Ling
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

The ecology of fungi lags behind that of plants and animals because most fungi are microscopic and hidden in their substrates. Here, we address the basic ecological process of fungal succession in nature using the microscopic, arbuscular mycorrhizal fungi (AMF) that form essential mutualisms with 70-90% of plants. We find a signal for temporal change in AMF community similarity that is 40-fold stronger than seen in the most recent studies, likely due to weekly samplings of roots, rhizosphere and soil throughout the 17 weeks from seedling to fruit maturity and the use of the fungal DNA barcode to recognize species in a simple, agricultural environment. We demonstrate the patterns of nestedness and turnover and the microbial equivalents of the processes of immigration and extinction, that is, appearance and disappearance. We also provide the first evidence that AMF species co-exist rather than simply co-occur by demonstrating negative, density-dependent population growth for multiple species. Our study shows the advantages of using fungi to test basic ecological hypotheses (e.g., nestedness v. turnover, immigration v. extinction, and coexistence theory) over periods as short as one season

eScholarship - University of California

clusterExperiment and RSEC: A Bioconductor package and framework for clustering of single-cell and other large gene expression datasets

Author: Das Diya
Dudoit Sandrine
Fletcher Russell
Ngai John
Purdom Elizabeth
Purvis Liam
Risso Davide
Publication venue
Publication date: 01/01/2018
Field of study

Clustering of genes and/or samples is a common task in gene expression analysis. The goals in clustering can vary, but an important scenario is that of finding biologically meaningful subtypes within the samples. This is an application that is particularly appropriate when there are large numbers of samples, as in many human disease studies. With the increasing popularity of single-cell transcriptome sequencing (RNA-Seq), many more controlled experiments on model organisms are similarly creating large gene expression datasets with the goal of detecting previously unknown heterogeneity within cells. It is common in the detection of novel subtypes to run many clustering algorithms, as well as rely on subsampling and ensemble methods to improve robustness. We introduce a Bioconductor R package, clusterExperiment, that implements a general and flexible strategy we entitle Resampling-based Sequential Ensemble Clustering (RSEC). RSEC enables the user to easily create multiple, competing clusterings of the data based on different techniques and associated tuning parameters, including easy integration of resampling and sequential clustering, and then provides methods for consolidating the multiple clusterings into a final consensus clustering. The package is modular and allows the user to separately apply the individual components of the RSEC procedure, i.e., apply multiple clustering algorithms, create a consensus clustering or choose tuning parameters, and merge clusters. Additionally, clusterExperiment provides a variety of visualization tools for the clustering process, as well as methods for the identification of possible cluster signatures or biomarkers. The R package clusterExperiment is publicly available through the Bioconductor Project, with a detailed manual (vignette) as well as well documented help pages for each function.</div

Directory of Open Access Journals

eScholarship - University of California

Archivio istituzionale della ricerca - Università di Padova

FigShare

Recommended from our members

Fungal community assembly in drought-stressed sorghum shows stochasticity, selection, and universal ecological dynamics.

Author: Coleman-Derr Devin
Dahlberg Jeffery A
Gao Cheng
Hollingsworth Joy
Hutmacher Robert B
Lemaux Peggy G
Madera Mary
Montoya Liliam
Purdom Elizabeth
Singan Vasanth
Taylor John W
Vogel John
Xu Ling
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Community assembly of crop-associated fungi is thought to be strongly influenced by deterministic selection exerted by the plant host, rather than stochastic processes. Here we use a simple, sorghum system with abundant sampling to show that stochastic forces (drift or stochastic dispersal) act on fungal community assembly in leaves and roots early in host development and when sorghum is drought stressed, conditions when mycobiomes are small. Unexpectedly, we find no signal for stochasticity when drought stress is relieved, likely due to renewed selection by the host. In our experimental system, the host compartment exerts the strongest effects on mycobiome assembly, followed by the timing of plant development and lastly by plant genotype. Using a dissimilarity-overlap approach, we find a universality in the forces of community assembly of the mycobiomes of the different sorghum compartments and in functional guilds of fungi

eScholarship - University of California

Transcriptomic analysis of field-droughted sorghum from seedling to maturity reveals biotic and metabolic responses.

Author: Baker Christopher R
Blow Matthew J
Cole Benjamin
Coleman-Derr Devin
Dahlberg Jeffery
DeGraaf Stephanie
Gao Cheng
Harrison Maria J
Hollingsworth Joy
Hutmacher Robert
Jansson Christer
Jeffers Tim
Lemaux Peggy G
Madera Mary
Niyogi Krishna K
O'Malley Ronan C
Owiti Judith A
Patel Dhruv
Pierroz Grady
Purdom Elizabeth
Sievert Julie
Singan Vasanth R
Taylor John W
Varoquaux Nelle
Visel Axel
Vogel John P
Xu Ling
Yoshinaga Yuko
Publication venue: eScholarship, University of California
Publication date: 01/12/2019
Field of study

Drought is the most important environmental stress limiting crop yields. The C4 cereal sorghum [Sorghum bicolor (L.) Moench] is a critical food, forage, and emerging bioenergy crop that is notably drought-tolerant. We conducted a large-scale field experiment, imposing preflowering and postflowering drought stress on 2 genotypes of sorghum across a tightly resolved time series, from plant emergence to postanthesis, resulting in a dataset of nearly 400 transcriptomes. We observed a fast and global transcriptomic response in leaf and root tissues with clear temporal patterns, including modulation of well-known drought pathways. We also identified genotypic differences in core photosynthesis and reactive oxygen species scavenging pathways, highlighting possible mechanisms of drought tolerance and of the delayed senescence, characteristic of the stay-green phenotype. Finally, we discovered a large-scale depletion in the expression of genes critical to arbuscular mycorrhizal (AM) symbiosis, with a corresponding drop in AM fungal mass in the plants' roots

Hal - Université Grenoble Alpes

eScholarship - University of California

Injury activates transient olfactory stem cell states with diverse lineage capacities

Author: Baudhuin Ariane
Choi Yoon Gi
Cole Michael B
Das Diya
Dudoit Sandrine
Fletcher Russell B
Gadye Levi
Ngai John
Purdom Elizabeth
Risso Davide
Sanchez Michael A
Street Kelly
Wagner Allon
Yosef Nir
Publication venue: Cell Press
Publication date: 01/01/2017
Field of study

Archivio istituzionale della ricerca - Università di Padova